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SUMMARY 


The  theory  of  extreme  values  Is  shown 
to  be  revelant  In  the  analysis  of  two 
stochastic  biological  phenomena. 
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"Bacterial  Extinction  Time  as  an  Extreme  Value  Phenomenon" 

by 

Benjamin  Epstein 

In  [3],  Mather  found  that  some  data  on  bacterial  extinction  times  in  the 

presence  of  a  bactericide  appeared  to  fall  along  a  straight  line  when  one  plotted 

log(-log  pt)  against  t  ,  where  pt  is  the  observed  proportion  of  samples  with 

no  surviving  bacteria  after  an  exposure  of  t  minutes  to  the  bactericide. 

In  Mather's  data,  pt  was  observed  in  samples  of  size  18  at  times  t  ■  12(2)26 

minutes,  and  p^  varied  from  zero  at  t  *  12  to  4  at  t  *  26.  Mather  was 
t  18 

led  by  heuristic  reasoning  to  the  model  log (-log  p^)  ■  a  +  Bt,  where  pt  is 

the  "true"  proportion  of  samples  showing  the  absence  of  bacteria  after  an  exposure 

of  time  t  to  the  bactericide.  Subsequently,  it  was  noted  by  Gumbel  on  pages 

43-44  of  [3]  that  Mather  had  given  an  interesting  application  of  the  theory  of 

-e*a  +  ^ 

extreme  values  without,  however,  using  the  term,  since  pt  •  e 

is  the  type  1  asymptotic  distribution  of  largest  values.  Gumbel  then  gave  an 
alternative  method  of  fitting  the  experimental  data,  which  he  deemed  somewhat  more 
appropriate  in  the  light  of  extreme  value  theory.  Actually,  there  appears  to  be 
little  to  choose  between  the  two  methods  of  fitting  the  data,  since,  in  this 
instance,  the  parameter  estimates  obtained  by  either  method  are  in  excellent 
agreement  with  each  other. 

Neither  Gumbel  nor  Mather  gave  a  theoretical  model  for  the  time  to  extinction 
problem.  To  provide  such  a  model  would  seem  to  be  of  some  interest,  particularly 
in  view  of  the  last  paragraph  on  page  44  or  [2]:  "The  three  figures  show  that  the 
theory  of  largest  values  in  an  adequate  tool  for  the  analysis  of  the  extinction  time 
of  bacteria  subject  to  a  disinfectant.  However,  the  philosophical  question  of  why 
these  data  follow  the  law  of  largest  values  Instead  of  smallest,  as  one  might  expect, 
remains  open." 
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We  now  give  a  simple  model  for  the  bacterial  extinction  time  problem  which 
leads  In  a  natural  way  to  the  type  1  distribution  of  largest  values.  In  the 
model  we  make  the  following  assumptions: 

a)  the  nunber  of  bacteria  in  a  sample  just  prior  to  be  the  introduction  of 
a  bactericide  at  time  t  *  0  is  a  Poisson  random  variable*  with 

mean  yQ  ; 

b)  the  "lifetime"  of  a  bacterium  in  the  presence  of  the  bactericide 

(a  fixed  dose  is  assumed)  is  distributed  with  c.d.f  F(t)  ,  t  ^0  , 

It  is  clear  that  the  presence  or  absence  of  bacteria  in  the  sample  after 
a  length  of  time  t  has  elapsed  depends  upon  whether  or  not  all  of  the  bacteria, 
x  ,  present  in  the  sample  at  time  t  ■  0  have  "died"  or,  putting  it  another  way, 
whether  or  not  the  longest  lived  bacterium  survives.  Since 


p(x) 


is  the  probability  that  x  bacteria  are  in  the  sample  at  time  t  -  0  and 
[F (t) ] X  is  the  probability  that  all  x  bacteria  "die"  by  time  t  (assuming  that 
bacterial  extinction  times  are  mutually  independent)  it  follows  from  the  theorem 
of  total  probability  that  G(t),  the  c.d.f  of  the  time  T  to  total  bacterial 
extinction  in  the  sample,  is  given  by** 

oo  g  ^O^X 

G(t)  -  Pr(T<t)  -  l  — r -  [F(t)]X 

x-0  X* 

(1)  -Y0U  -  F(t)  1 

-  e  ,  t  >  0  . 

*  If  the  sample  consists  of  v  cc  drawn  from  a  solution  in  which  bacteria  are 
distributed  at  random  with  6  bacteria  per  unit  volume,  than  yq  •  6  v  . 

ft  A  simpler  proof  of  (1)  is  as  follows:  If  xq  ,  the  number  of  bacteria  in  the 

sample  at  time  t  »  0  ,  is  Poisson  distributed  with  mean  X^  ,  the  number  of 

bacteria  living  longer  than  time  t  ,  is  Poisson  distributed  with  mean 

Y0(l  -  F(t) ]  .  Consequently,  the  probability  that  the  sample  will  be  bacteria- 

free  is  Pr(xt  -  0)  -  e^O*1  ”  .  But  the  events  T  t  and  x£  -  0  are 

equivalent  and  this  establishes  (1). 


-yQ 

A  discrete  amount  of  probability  e  ,  which  corresponds  to  the  probability 

that  there  are  no  bacteria  in  the  sample  at  time  t  ■  0  ,  is  assigned  to  the 

+  “Y0 

c.d.f  G(t)  at  t  -  0  ,  i.e.,  G(0  )  -  e  u  . 

If  bacterium  lives  are  distributed  with  c.d.f  F(t)  -  1  -  e"X£  ,  t  >  0  ,  an 
assumption  that  would  appear  to  be  reasonable  in  this  situation,  G(t)  becomes 


-Y  e"Xt 

(2)  G(t)  -  e  0  ,  t  >  0 

The  c.d.f  (2)  is  called  the  type  1  distribution  of  largest  values  in  Gumbel's 
terminology.  It  follows  from  (2)  that 

(3)  log (-log  G(t) )  -  log  Yq  -  Xt  . 


This  is  a  theoretical  justification  of  the  model  used  to  fit  the  date  in  [3]. 

A  useful  measure  of  the  effectiveness  of  the  bactericide  is  t  ,  the  modal 
or  most  probable  time  at  which  bacterial  extinction  takes  place,  t  ia  the 
solution  of  the  equation  g'(t)  -  0  ,  where  g(t)  ,  the  p.d.f  of  the  time  to 
extinction,  is  given  by 


(A) 


’Y0e 

g(t)  -  G'(t)  -  yQ\e  e 


-Xt 


,  t  >  0  . 


It  is  readily  verified  that  the  maximum  value  of  g(t)  is  attained  at 

log  Y0 


(5) 


t  - 


The  expected  number  of  bacteria  surviving  In  s  sample  after  exposure  to  the 

bactericide  for  a  length  of  time  t  is  Yq«  Xt  •  1  .  Similarly,  G(t)  •  e~*  . 

Thus,  t  is  that  exposure  time  such  that,  on  the  average,  one  bacterium  In  the 

t 

sample  still  survives  and  it  is  also  that  exposure  time  for  which  37%  of  the 
samples  become  bacteria  free.  As  pointed  out  by  Mather,  t  is  a  simpler  measure 
of  the  time  to  extinction  in  a  sample  than  the  more 

*  This  is  not  to  be  confused  with  the  probability  of  death  of  a  single  bacterium 
in  the  presence  of  the  bactericide.  If  the  time  to  extinction  distribution  of 

T  *  i 

individual  bacteria  is  given  by  F(t)  •  1  -  e  Xt,  then  F(t)  •  1  -  —  •  — 

Y0  Y0  - 

This  the  probability  that  an  individual  bacterium  will  die  on  or  before  time  t 
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common  t  ,  the  time  at  which  50%  of  the  samples  become  bacteria  free,  t 
is  related  to  t  by  the  formula 

(6)  t  „  -  £  -  ±2S-i2S-2 

.50  X 


-Xt 

-YQe  .50 

as  can  easily  be  seen  from  the  relations  e  -  1/2  .  More  generally, 

t  ,  the  time  at  which  lOOpX  of  the  samples  become  bacteria  free,  is  related 


to  t  by  the  formula 
(7) 


t  -  t  - 
P 


log  log  ~ 


It  is  also  useful  to  point  out  that  if  the  sample  drawn  from  the  solution  to 
which  bactericide  had  been  added  contained  kv  c.c.  rather  than  v  c.c.,  then 
the  modal  extinction  time  t^  for  samples  of  this  size  would  be 

(8) 


Our  discussion  has  been  purely  theoretical  under  the  assumptions  that  the 
model  resulting  in  equation  (2)  is  a  reasonable  description  of  the  bacteriological 
situation.  In  practice,  of  course,  y q  and  X  may  be  unknown  and  would  be 
estimated  either  graphically  or  analytically  by  the  methods  described  by  Mather 
and  Gumbel.  The  value  of  the  model  is  that  it  assigns  a  meaning  to  each  of  the 

parameters.  For  the  data  in  [3],  Yq  and  X  can  be  estimated  respectively 

*  * 

(see  pages  140-141  of  (3])  as  y  *  794  and  X  -  .376.  Thus  we  can  say  from 

the  data  that  the  number  of  bacteria  in  the  sample  at  time  t  »  0  is  Poisson 

distributed  with  estimatea  mean  «  794  and  that  in  the  presence  of  1.15%  Phenol 

the  life  of  an  individual  bacterium  is  exponentially  distributed  with  estimated 

mean  life  ■  2.66  minutes.  Substituting  in  (5),  (6)  and  (7),  one  obtains 
X 

17.75,  18.73,  and  25.65  minutes  respectively  as  estimates  of  t  ,  t  ^  and  t  ^ 
Substituting  in  (8),  one  obtains  17.75  +  6.*2  ■  23.87  minutes  as  an  estimate  of 
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the  modal  extinction  time  in  a  10v  cc  sample. 

Comparisons  between  the  effectiveness  of  the  same  bactericide  at  various 
dosages  or  between  different  bactericides  at  the  same  dosage,  can  be  made  by 
comparing  the  corresponding  X  values. 

The  model  for  bacterial  extinction  times  given  in  this  paper  leads  to  the 
type  1  distribution  of  largest  values.  It  is  interesting  to  note  that  the  model 
given  recently  by  Gart  [1]  for  the  distribution  of  response  times  after  inoculating 
a  host  with  a  solution  containing  some  microorganisms  can  also  be  considered  in 
the  framework  of  extreme  value  theory,  the  only  difference  being  that  response 
times  are  distributed  as  smallest  values.  The  key  points  in  the  model  are:  1)  the 
number  of  inoculating  particles  in  the  sample  is  a  Poisson  random  variable  with 
mean  y  ,  2)  the  Incubation  time  for  a  single  particle  is  assumed  to  be  a  random 
variable  with  c.d.f  F(t),  3)  a  positive  response  occurs  on  or  before  time  t  if 
and  only  if  at  least  one  of  the  x  inoculating  particles  present  at  time  t  *  0 
has  incubated,  or  putting  it  another  way,  if  and  only  if  the  particle  taking  the 
shortest  time  to  incubate  has  done  so  by  time  t  .  It  follows  readily  from  these 

r«  /  *  \ 

assumptions  that  the  c.d.f  of  the  response  time  is  H(t)  ■  1  -  e  ,  t  >_  0  . 

— Xt  N 

If  one  lets  the  incubation  time  c.d.f  for  a  simple  particle  be  F(t)  •  [1  -  e  ] 
as  is  done  in  (1],  then  it  can  be  shown  that  for  large  y  ,  H(t)  can  be 
approximated  by  the  c.d.f  1  -  e  Y  .a  type  3  distribution  of  smallest  values. 
From  this  it  follows  that  there  is  an  approximately  linear  relationship  between  the 
logarithm  of  the  modal  response  time  (or,  store  generally,  the  logarithm  of  any 
percentile  of  the  response  of  the  response  time)  and  the  logarithm  of  y  . 
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